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UTILISATION OF CONSTRUCTS COMPRISING RECOMBINATION SEQUENCE 
MOTIFS FOR ENHANCING GENE EXPRESSION IN MOSS 
This is a National Phase Application in the United States of 
International Patent Application No. PCT/EP2004/008521 filed July 
29, 2004, which claims priority on European Patent Application 
No. 03017343.9, filed July 31, 2003. The entire disclosures of 
the above patent applications are hereby incorporated by 
reference . 

This application incorporates by reference the Sequence 
Listing on the compact disc, namely the file "Sequence- 
listing. APP" created on January 26, 2006 with a size of 4 
kilobytes . 

Background of the Invention 

The present invention relates to methods and materials for 
improving gene expression in eucaryotic cells, particularly in 
plant cells comprised in mosses, such as moss protonema cells. 



Related Art 



Gene amplification for improving the expression of recombinant 
proteins in mammalian cell cultures is a generally used strategy 
(Herlitschka et al . (1996) Protein Expr. Purif. 8, 358-364; 
Ringold et al . (1981) J. Mol . Appl . ) . 

In plants, effecting gene amplification strategies is problematic 
due to silencing events that can be triggered by multi-copy 
integrations of heterologous DNA (Asaad et al. (1993) Plant Mol 
Biol. 22, 1067-1085) . Recently, strategies for gene amplification 
in plants have been developed to overcome these limitations. The 
cis-acting genetic element aps was isolated from a non- 
transcribed spacer region of tobacco ribosomal DNA. This spacer 
element was fused to reporter genes of interest and resulted in 
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in higher expression levels of heterologous proteins therefrom 
(Borisjuk et al . (2000) Nature Biotechnol. 18, 1303-1306). 



A further strategy has been described by Klimyuk et al . which 
5 involved the expression of heterologous proteins via trans- 
splicing (WO 02/097080) . 

To date, little is known about the correlation of copy number and 
heterologous gene expression in transgenic moss plants. The use 

10 of mosses for the production of recombinant proteins is a well- 
established technology (EP1206561, Gorr et al . 2001, Naunyn- 
Schmiedeberg' s Arch. Pharmacol. 363 Suppl . : R 85). Typically, 
anything from 1 to about 50 copies of the transforming plasmid 
may be integrated into the genome of transformed moss tissue 

15 (Schaefer (2002) Annu. Rev. Plant Biol. 53, 477-501). Depending 
on the design of the transforming constructs employed, homologous 
recombination, that is, a targeted integration event and/or 
heterologous recombination, that is, a random or non-targeted 
integration event can occur. Thus, by using DNA sequences (i.e. 

20 comprised of coding or non-coding sequences ) for transformation 
which are homologous to genomic DNA sequences of a moss can 
result in one or more homologous recombination events via 
integration of the introduced or transforming DNA into the 
genomic locus of the homologous DNA. Use of DNA sequences (i.e. 

25 comprised of coding or non-coding sequences) for transformation 
that lack any appreciable homology to a genomic DNA sequence of 
a moss can result in one or more heterologous recombination 
events via integration of the introduced DNA randomly into the 
genome. Moss is the only known plant system which displays a high 

30 frequency of homologous recombination (Strepp et al . (1998) Proc . 
Natl. Acad. Scl . USA 95, 4368-4373; Schaefer (2002) Annu. Rev. 
Plant Biol. 53, 477-501). This apparently unique attribute of 
mosses has been used for the targeted introduction of genes. 
However, the amplification of gene expression by increasing the 
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copy number of plasmids of interest in order to generate greater 
levels of protein per unit mass of stably transformed moss tissue 
has not hitherto been described. 

5 Surprisingly, it has been found that by transforming, typically 
co-transforming cells (protoplasts) of moss tissue with at least 
two heterologous nucleic acid sequences comprising at least one 
set of recombination sequences results in an increase in the 
integrated copy number of heterologous nucleic acid constructs in 
10 regenerated tissue, such as cells comprised in moss protonema, 

which in turn is correlated with an increase of protein 
expression levels . 

It is therefore an object of the invention to provide an improved 
15 method for the production of proteins of interest in cells 
comprised in moss tissue. 

Brief Summary of -the Invention 

20 According to the present invention there is provided a method of 
amplifying gene expression in a moss plant cell comprising 

1) providing at least a first heterologous nucleic acid 

construct comprising at least one heterologous nucleotide 
25 sequence operably linked to a promoter, wherein the said 

construct is flanked at the 5' end thereof by a first 
recombination sequence and is flanked at the 3 ? end of the 
said construct by a second recombination sequence in the 
same orientation as the first; 

30 

2) providing at least a second heterologous nucleic acid 

construct comprising at least one heterologous nucleotide 
sequence operably linked to a promoter, wherein the said 
construct is flanked at the 5 1 end thereof by said second 
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recombination sequence and is flanked at the 3 f end of the 
said construct by said first recombination sequence in the 
same orientation as the second; and 

5 3) transforming into the moss plant cell at least said first 

and said second heterologous nucleic acid construct. 

The skilled addressee will appreciate that once the said at least 
two heterologous constructs are transformed into the moss plant 

10 cell, such as a moss protoplast, for example a Physcomitrella 

patens protoplast, which is then permitted to regenerate into 
moss protonema, for example of Physcomitrella patens, they will 
undergo recombination with each other many times over. This 
process, once initiated in the moss plant cell, increases the 

15 copy number of integrated transforming DNA constructs of the 
invention therein. 

Thus, as a further aspect of the invention there is provided a 
moss protonema, preferably protonema of Physcomitrella patens, 
20 comprised of cells stably transformed, more preferably- co- 
transformed with at least two complementary constructs of the 
invention . 

Ultimately, significant increases in the level of heterologous 
25 protein of interest from the at least one heterologous gene of 
interest are measurable over and above the levels of protein that 
are measurable in moss protonema cells from conventional 
transforming constructs lacking the features of constructs of the 
invention. The at least first and the at least second 
30 recombination sequences form a complementary set that make it 
possible for the constructs of the invention to recombine with 
each other. Naturally, the skilled addressee will appreciate that 
constructs of the invention may be employed in which one or more 
complementary sets of recombination sequences may be used 
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depending on how many of the same or different nucleotide 
sequences of interest are intended to be utilised for protein 
production, such as 1, 2, 3, 4, or 5 or more sets. Preferably a 
single- complementary set of recombination sequences is used for 
5 ease of convenience. 

In a further aspect of the invention there is provided a 
heterologous DNA construct of the invention that comprises in the 
5 1 to 3 1 direction: 

1) an introduced first recombination sequence; 

2) at least a heterologous nucleic acid sequence of interest 
comprising a promoter operably linked thereto and 
optionally a terminator therefor; and 

3) an introduced second recombination sequence. 

In a further aspect of the invention there is provided a 
heterologous DNA construct of the invention that comprises in the 
5 1 to 3' direction: 

1) an introduced second recombination sequence; 

2) at least a heterologous nucleic acid sequence of interest 
comprising a promoter operably linked thereto and optionally a 
terminator therefor; and 

3) an introduced first recombination sequence. 

25 Thus the two constructs comprise similar complementary 
recombination sequences located at different sites therein that 
enable or permit the constructs to recombine with each other in 
situ in transformed moss protonema cells comprised in the moss 
protonema, for example protonema of Physcomitrella patens. 

30 Preferably, the constructs of the invention are in linear form. 

Such constructs may be used to transform moss protoplasts in at 
least two separate transformation events where a first 
transformation event is separated from a second transformation 



10 
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event in time or the constructs of the invention may be co- 
transformed into moss protoplasts which are then permitted or 
allowed to regenerate into moss protonema . Preferably, the 
transformation event comprises co-transforming moss protoplasts 
5 with at least two constructs of the invention as described above. 



The recombination sequence utilised in constructs of the 
invention may be any sequence selected from any organism, such as 
from plant genomic DNA, such as from genomic DNA, cDNA, intron or 

10 exon regions or non-coding regions or any combination thereof, 

for example, from Physcomltrella patens. Suitable genomic DNA for 
use as recombination sequence may comprise DNA from an exon or an 
intron or a hybrid of the two . Preferably the recombination 
sequence is formed of DNA from an intron or non-coding region of 

15 DNA. As discussed herein, the orientation of the two flanking 
recombination sequences is preferably in the same orientation, 
for example, in the 5' to 3 1 direction or in the 3 1 to 5' 
direction in both of the two transforming constructs albeit that 
the actual location of the recombination sequences within the two 

20 constructs is different one from the other as alluded to above. 
Naturally, the skilled addressee will appreciate that the 
heterologous constructs of the invention will comprise 
recombination sequences in appropriate position and orientation 
that enables recombination events to occur between the two. The 

25 recombination nucleotide sequences of constructs of the invention 
can be of any length provided that they are capable of causing or 
permitting recombination events to occur. Suitable lengths for 
the recombination sequences employed in constructs of the 
invention range from 25 - 1000 nucleotides in length or longer; 

30 from 25 - 650 nucleotides in length; from 50 - 650 nucleotides in 
length; from 100 - 400 nucleotides in length; or from 200 - 400 
nucleotides in length, for example, of about 200 +/- 50 
nucleotides in length. The skilled addressee will appreciate that 
the length of the recombination sequences of constructs of the 
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invention may vary depending on design 



As a further aspect of the invention, there is provided a moss 
cell comprised of constructs of the invention, moss protonema 
5 comprised of said moss cells, and/or moss plants comprising 
constructs of the invention, particularly a moss protonema cell, 
moss protonema comprised of protonema cells comprised of 
constructs of the invention, and/or moss plants comprised of 
constructs of the invention that are Physcomitrella patens. 

10 

Particular aspects of the invention will now be discussed in more 
detail . 

Detailed Description of the Invention 

15 

Deflnl tlons 

The term "heterologous" is used broadly below to indicate that 
the gene/sequence of nucleotides in question have been introduced 

20 into moss protoplasts using genetic engineering, i.e. by human 
intervention. A heterologous gene may augment the expression of a 
protein of interest from an endogenous equivalent gene, i.e. one 
which normally performs the same or a similar function, or the 
inserted sequence may be additional to the endogenous gene or 

25 other sequence. Nucleic acid heterologous to a cell may be non- 
naturally occurring in moss protoplasts of that type, variety or 
species. Thus the heterologous nucleic acid may comprise a 
coding sequence of, or derived from, a particular type of 
organism, such as a mammalian species, e.g of human, ovine, 

30 bovine, equine, or porcine species, placed within the context of 
a moss protoplast, such as a protoplast derived from 
Physcomitrella patens. A further possibility is for a nucleic 
acid sequence to be placed within a moss protoplast in which it 
or a homologue is found naturally, but wherein the nucleic acid 
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sequence is linked and/or adjacent to nucleic acid which does not 
occur naturally within the cell, or cells of that type or species 
or variety of plant, such as operably linked to one or more 
regulatory sequences, such as a promoter sequence, for control of 
5 expression. 

"Gene" unless context demands otherwise refers to any nucleic 
acid encoding genetic information for translation into a peptide, 
polypeptide or protein. 

10 

"Vector" is defined to include, inter alia, any plasmid, cosmid, 
phage, or viral vector in double or single stranded linear or 
circular form which may or may not be self transmissible or 
mobilizable, and which can transform a prokaryotic or eukaryotic 

15 host and exists extrachromosomally (e.g. autonomous replicating 
plasmid with an origin of replication) . Specifically included 
are shuttle vectors by which is meant a DNA vehicle capable, 
naturally or by design, of replication in two different host 
organisms, which may be selected from actinomycetes and related 

20 species, bacteria and eucaryotic (e.g. higher plant, mosses, 
mammalian, yeast or fungal) cells. 

"Expression vector" refers to a vector in which a nucleic acid is 
under the control of, and operably linked to, an appropriate 

25 promoter or other regulatory elements for transcription in a host 
cell such as a microbial cell or a moss protoplast. The vector 
may be a bi-f unctional expression vector which functions in 
multiple hosts . In the case of genomic or subgenomic DNA, this 
may contain its own promoter or other regulatory elements and in 

30 the case of cDNA this may be under the control of an appropriate 
promoter or other regulatory elements for expression in the host 
cell . 

A "promoter" is a sequence of nucleotides from which 
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transcription may be initiated of DNA operably linked downstream 
(i.e. in the 3 1 direction on the sense strand of double-stranded 
DNA) . 

5 "Operably linked" means joined as part of the same nucleic acid 
molecule, suitably positioned and oriented for transcription to 
be initiated from the promoter. 

The term "inducible" as applied to a promoter is well understood 
10 by those skilled in the art. In essence, expression under the 
control of an inducible promoter is "switched on" or increased in 
response to an applied stimulus. The nature of the stimulus 
varies between promoters . Some inducible promoters cause little 
or undetectable levels of expression (or no expression) in the 
15 absence of the appropriate stimulus. Other inducible promoters 
cause detectable constitutive expression in the absence of the 
stimulus. Whatever the level of expression is in the absence of 
the stimulus, expression from any inducible promoter is increased 
in the presence of the correct stimulus. 

20 

The invention also embraces use of a variant of any of these 
sequences . A variant protein shares homology with, or is 
identical to, all or part of the sequences discussed above. 
Generally speaking, wherever the term is used herein, variants 
25 may be: 

(i) naturally occurring homologous variants of the relevant 
protein, 

30 (ii) artificially generated homologous variants (derivatives) 
which can be prepared by the skilled person in the light of the 
present disclosure, for instance by site directed or random 
mutagenesis, or by direct synthesis. Preferably the variant 
nucleic acid, encoding the variant polypeptide, is generated 
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either directly or indirectly (e.g. via one or more amplification 
or replication steps) from an original nucleic acid. Changes to 
the nucleic acid sequence may produce a derivative by way of one 
or more of addition, insertion, deletion or substitution of one 
5 or more nucleotides in the nucleic acid, leading to the addition, 
insertion, deletion or substitution of one or more amino acids in 
the encoded polypeptide. Desirable mutation may be random or 
site directed mutagenesis in order to alter the activity (e.g. 
specificity) or stability of the encoded polypeptide. Changes 

10 may be by way of conservative variation, i.e. substitution of one 
hydrophobic residue such as isoleucine, valine, leucine or 
methionine for another, or the substitution of one polar residue 
for another, such as arginine for lysine, glutamic for aspartic 
acid, or glutamine for asparagine. Also included are variants 

15 having non-conservative substitutions. In regions which are 
critical in determining the peptides conformation or activity 
such changes may confer advantageous properties on the 
polypeptide e.g. altered stability or specificity. 



20 Similarity or homology in the case of variants is preferably 
established via sequence comparisons made using FAST A and FAS TP 
(see Pearson & Lipman, 1988. Methods in Enzymology 183: 63-98). 
Parameters are preferably set, using the default matrix, as 
follows : 

25 

Gapopen (penalty for the first residue in a gap) : -12 for 
proteins / -16 for DNA 

Gapext (penalty for additional residues in a gap) : -2 for 
proteins / -4 for DNA 
30 KTUP word length: 2 for proteins / 6 for DNA. 



Homology may be at the nucleotide sequence and/or encoded amino 
acid sequence level. Preferably, the nucleic acid and/or amino 
acid sequence shares at least about 75%, or 80% identity, most 
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preferably at least about 90%, 95%, 96%, 97%, 98% or 99% 
identity . 



Homology may also be assessed by use of a probing methodology 
5 (Sambrook et al., 1989). One common formula for calculating the 
stringency conditions required to achieve hybridization between 
nucleic acid molecules of a specified sequence homology is: T m = 
81.5°C + 16.6Log [Na + ] + 0.41 (% G+C) - 0.63 (% formamide) - 
600/#bp in duplex. As an illustration of the above formula, 
10 using [Na+] - [0.368] and '50-% formamide, with GC content of 42% 
and an average probe size of 200 bases, the T m is 57°C. The T m of 

* 

a DNA duplex decreases, by 1 - 1 . 5°C with every 1% decrease in 
homology. Thus, targets with greater than about 75% sequence 
identity would be observed using a hybridization temperature of 
15 42°C. 

Use In moss plants 

As described below, in its various aspects, the invention will 
20 generally be employed on moss protoplasts, using nucleic acids 
encoding proteins of interest. 

Suitable promoters which operate in moss protoplasts include the 
Cauliflower Mosaic Virus 35S (CaMV 35S) . Other examples are 

25 disclosed at pg 120 of Lindsey & Jones (1989) Plant Biotechnology 
in Agriculture" Pub. OU Press, Milton Keynes, UK. The promoter 
may be selected to include one or more sequence motifs or 
elements conferring developmental and/or tissue-specific 
regulatory control of expression. Inducible plant promoters 

30 include the ethanol induced promoter of Caddick et al (1998) 
Nature Biotechnology 16: 177-180. 



A terminator is contemplated as a DNA sequence at the end of a 
transcriptional unit which signals termination of transcription. 
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These elements are 3 ' -non-translated sequences containing 
polyadenylation signals, which act to cause the addition of 
polyadenylate sequences to the 3 1 end of primary transcripts. For 
expression in plant cells the nopaline synthase transcriptional 
5 terminator (A. Depicker et al . , 1982, J. of Mol. & Applied Gen. 
1:561-573) sequence may serve as a transcriptional termination 
signal, as can the CaMV 35S terminator (Topfer et al . (1987) NAR 
15, 5890) . 

10 If desired, selectable genetic markers may be included in further 
conventional constructs, such as circular plasmids or in further 
linearised DNA constructs that are co-transformed into a moss 
cell of the invention, such as those that confer selectable 
phenotypes such as resistance to antibiotics or herbicides (e.g. 

15 kanamycin, hygromycin, phosphinotricin, chlorsulf uron, 

methotrexate, gentamycin, spectinomycin, imidazolinones and 
glyphosate) . 

The present invention also provides methods comprising the 
20 introduction of such constructs comprising appropriate 
heterologous sequences into a moss plant cell and/or induction of 
expression of a construct of the invention within a moss plant 
cell, by application of a suitable stimulus e.g. an effective 
exogenous inducer. Suitable moss plant cells include the moss 
25 protoplast, and cells comprised in the protonema, such as those 
derived from Physcomltrella patens. 

Nucleic acid can be introduced into moss protoplasts using any 
suitable technology, such as PEG-mediated DNA uptake as herein 
30 described, particle or micropro j ectile bombardment (US 5100792, 
EP-A-444882, EP-A-434616) microinjection (WO 92/09696, WO 
94/00583, EP 331083, EP 175966, Green et al. (1987) Plant Tissue 
and Cell Culture, Academic Press), electroporation (EP 290395, WO 
8706614 Gelvin Debeyser) other forms of direct DNA uptake (DE 
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4005152, WO 9012096, US 4684611), liposome mediated DNA uptake 
(e.g. Freeman et al. Plant Cell Physiol. 29: 1353 (1984)), or the 
vortexing method (e.g. Kindle, PNAS U.S.A. 87: 1228 (1990d) 
Physical methods for the transformation of plant cells are 
5 reviewed in Oard, 1991, Biotech. Adv. 9: 1-11. 

Electroporation, PEG-mediated DNA uptake and direct DNA uptake 
are preferred. Especially preferred is the modified PEG mediated 
DNA uptake procedure as disclosed in the examples herein. 

10 

The particular choice of a transformation technology will be 
determined by its efficiency to transform certain moss species as 
well as the experience and preference of the person practising 
the invention with a particular methodology of choice. It will be 
15 apparent to the skilled person that the particular choice of a 
transformation system to introduce nucleic acid into moss 
protoplasts is not essential to the invention. However, the use 
of the PEG-mediated DNA transformation system as described herein 
is preferred. 

20 

Thus various aspects of the present invention provide a method of 
transforming a moss protoplast involving introduction of a 
heterologous nucleic acid-based construct of the invention as 
described herein into a moss protoplast and regeneration of the 

25 protoplast into protonema tissue and causing or allowing 
expression of protein from the constructs of the invention. Thus, 
the skilled addressee may expect that expression of protein 
targeted to the cytosol or other cellular compartments can be 
improved by using constructs and methods of the invention. 

30 Preferably, recombinant proteins produced by the methods of the 
invention are secreted into the medium from stably transformed 
protonemal tissue . 

Thus, by employing the at least two constructs of the invention 
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as hereindescribed production lines may be generated harbouring 
high copy numbers of the target gene which in turn results in 
high protein yields over the cultivation period in a suitable 
bioreactor . 

5 

Choice of genes to enhance 

Genes of interest include those encoding proteins which are 
themselves, natural medicaments such as pharmaceuticals or 
10 veterinary products. 

Heterologous nucleic acids may encode, inter alia, genes of 
bacterial, fungal, plant or animal origin. Polypeptides produced 
may be utilised for producing polypeptides which can be purified 

15 therefrom for use elsewhere. Such proteins include, but are not 
limited to retinoblastoma protein, p53, angiostatin, and leptin. 
Likewise, the methods of the invention can be used to produce 
mammalian regulatory proteins. Other sequences of interest 
include proteins, hormones, such as follicle stimulating hormone, 

20 growth factors, cytokines, serum albumin, haemoglobin, collagen, 

thaumatin, thaumatin-like proteins, epidermal growth factors such 
as VEGF, heterodimers , antibodies, immunoglobulins, fusion 
antibodies and single chain antibodies. 

25 Expression of target genes 

Generally speaking, heterologous nucleic acids may be expressed 
by any appropriate process used in the art or they may be 
transcribed or expressed as follows: 

30 

(i) expression of 'naked 1 DNA e.g. comprising a promoter operably 
linked to the heterologous sequence in a construct of the 
invention, 



1061WO 15 

(ii) expression from an expression vector, such as a replicating 
vector. Generally speaking, those skilled in the art are well 
able to construct vectors and design protocols for recombinant 
gene expression. Suitable vectors can be chosen or constructed, 
5 containing appropriate regulatory sequences, including promoter 
sequences, terminator fragments, polyadenylation sequences, 
enhancer sequences, marker genes and other sequences as 
appropriate. For further details see, for example, Molecular 
Cloning: a Laboratory Manual: 2nd edition, Sambrook et al, 1989, 
10 Cold Spring Harbor Laboratory Press or Current Protocols in 

Molecular Biology, Second Edition, Ausubel et al. eds . , John 
Wiley & Sons, 1992. 

As discussed above, the present inventors show that enhanced 
expression from constructs of the invention introduced 
15 (preferably at high levels) into the protoplasts of a moss, 
preferably at high cell density, such as Physcomitrella patens, 
which constructs are integrated into the genome give rise to 
transcribed mRNA. 

20 Thus in one aspect of the invention there is disclosed use of a 
transformed moss protoplast capable of generating mRNA encoding a 
target protein generated by transcription from an introduced 
nucleic acid construct of the invention including the target 
nucleotide sequence operably linked to a promoter, which 

25 construct is introduced into the cell of an organism. 

The "introduced nucleic acid" will thus include the heterologous 
nucleic acid sequence as a DNA sequence provided in the form of a 
construct of the invention that is capable of giving rise to the 
30 production of extracellular protein at an elevated level relative 
to the level of protein production normally associated with 
stable transgene expression of the said DNA sequence. In one 
aspect of the invention, the heterologous nucleic acid sequence 
may encode a protein that is made up of a signal and/or a transit 
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peptide coupled to the protein or polypeptide sequence of choice. 

The reporter can be any detectable protein, such as a marker 
gene, commonly used in the art such as GUS, GFP, luciferase etc. 
5 Preferably, the reporter is a non-invasive marker such as GFP or 
luciferase . 

Naturally, the man skilled in the art will recognise that more 
than one heterologous nucleic acid sequence may be used in the, 

10 or each, construct of the invention, although a single sequence 
in each case is preferred. Multiple vectors (each including one 
or more nucleotide sequences encoding heterologous protein of 
choice) may be introduced into the moss protoplasts via PEG- 
mediated DNA uptake methods as described herein. This may be 

15 useful for producing e.g. multiple subunits e.g. of an enzyme. 

In a further embodiment of the invention high levels of fully and 
correct assembled proteins consisting of multiple subunits can be 
achieved by influencing the stoichiometry of the different coding 

20 nucleic acid sequences integrated into the genome . 

The amount of proper assembled protein that consists of multiple 
subunits is dependent on the stoichiometry of the subunits on the 
protein level. In the case of subunits which have to be targeted 
to different compartments via signal peptides e.g. to the 

25 secretory pathway, the stoichiometry is not only influenced by 
the expression derived from e.g. promoter and transcriptional 
signals but also by the targeting signal and processing of 
targeting signal, e.g. proper cleavage of the signal peptide. In 
this aspect of the invention use of non-equimolar quantities of 

30 the nucleic acid sequences coding for the different subunits may 
be appropriate for multimeric proteins, e.g. for immunglobulins . 
Non-equimolar quantities of coding nucleic acids resulting in 
proper stoichiometry of multiple subunits of a dimeric or 
multimeric protein can thus be achieved by providing 



m 



1061WO 17 

appropriately designed constructs of the invention that enable 
correct assembly of the different subunits . 

As described in the Examples below, expression of heterologous 
5 sequences using methods of the invention when introduced in this 
way can give very high levels of target polypeptide over the 
course of the expression period, which will generally be several 
days, depending on the precise methods and materials employed. By 
using the methods of the invention as herein described, high 

10 levels of heterologous polypeptide production from stably 

incorporated constructs of the invention from regenerated 
transformed, preferably co-transformed protonema can be achieved. 
All references discussed herein, inasmuch as they may be required 
to supplement the present disclosure, are incorporated herein in 

15 their entirety by reference. 

The invention will now be further described with reference to the 
following non-limiting Figures and Examples. Other embodiments 
of the invention will occur to those skilled in the art in the 
20 light of these. 

EXAMPLES 

Methods and Materials 

25 

Plant material 

The wild-type strain of Physcomitrella patens (Hedw.) B.S.G. 
(Reski et al . 1994) is used. It is a subculture of strain 16/14 
30 which was collected by H.L.K. Whitehouse in Gransden Wood, 
Huntingdonshire, UK and propagated by Engel (1968) . 

Construction of vectors 



35 Construction of pRTlOlVEGF C3 
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Human vascular endothelial growth factor 121 (VEGF 12 i) cDNA 
without leader sequence is excised as an Ndel-Sall fragment from 
pCYTEXP-VEGF 12 i (GBF, Braunschweig, Germany) . This fragment is 
5 blunted by the Klenow reaction and introduced into pRTlOl (Topfer 
et al. 1987) at the Smal restriction site to form plasmid 
pRTlOlVEGF C3. In this construct, the VEGF 12 i cDNA minus leader 
sequence was placed downstream of the CaMV 35 S promoter and 
behind the CaMV terminator (Gorr, 1999) . 

10 

Construction of pRTlOlTPVEGF C3 

15 The sequence for VEGF ' signal peptide (sorting signal for 
secretion) is cloned into pRTlOlVEGF C3. The signal peptide cDNA 
is amplified from the plasmid pRTlOl P21 (Gorr, 1999) using the 5 ? 
primer MoB32 3 (5'- ATA CTC GAG GAA GAT GAA CTT TTC TGC CTG TCT 
TGG -3 1 , SEQ ID NO 1) containing an Xhol restriction side and 3 1 

20 primer MoB349 (5 1 - CTG CCA TGG GTG CAG CCT GGG ACC AC -3 ' , SEQ 
ID NO 2) containing iVcol restriction side. The amplified DNA is 
digested with Xhol and iVcol and ligated into pRTlOlVEGF C3 (Xhol 
/ Ncol digested) resulting in pRTlOlTPVEGF C3 . The resulting 
plasmid contains the coding sequences for the VEGF signal peptide 

25 and VEGF 12 i in frame under control of the CaMV 35 S promoter. 

Cloning procedure for 5' first recombination sequence into pRT99 

30 The 250 bp 5' sequence of the 5 th intron: (5'- 

GCGGAAATGTTCAGAGTTAAGCGAAATCACAACTAAAAGAGATTGGAAGCAGAAGAATT 
TTTGAGCAGCTGTTCTTAATTCACGCAACGACAACGCTATTAACTGTATGTGTAGACGAT 
GCACTTTCGTACTGAAGGGATCTAAATTTATTATATCCCTTCATAACTAGAGGCAAGGCG 
GAAATCACAAAACTATTGGTACCTACGTACTACAGCCTCCAGGATCAAACATAAGAGTGA 
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AACACTGGACC -3', SEQ ID NO 3) of the alpha 1 , 3-f ucosyltransf erase 
gene of Physcomitrella patens is amplified from genomic DNA of 
Physcomitrella patens by Pfu-proof -reading PCR (Promega, Germany) 
using the upstream primer Recl_SalI_SacII (5 ' -GAG GTC GAC CCG CGG 
5 AAA TGT TCA GAG- 3 1 , SEQ ID NO 4) and the downstream primer 
Recl_SmaI (S'-CTC CCC GGG TCC AGT GTT TCA CTC-3', SEQ ID NO 5). 
After restriction of the resulting amplification product with 
Sail and Smal it is cloned into the vector pRT99 (Topfer et al . 
1988) (Sail and Smal digested) . The resulting plasmid pRT99Recl 
10 contains the 5 f first recombination sequence. 



Cloning procedure for the 3 1 second recombination sequence into 
15 pRT99Recl 

The 208 bp 3 f sequence of the 5 th intron: (5'- 

GGGACCCAAGCGTAAGAAGTCTTATGAAAAAGTTACCTCACAGATTAAAACTAAACATAGGA 
AAATACCAATGCACTCCAATGTGTCAATGAGATTAACGCTTGACTAACATGAAAATATAA 

20 ATATTCACCGAATGAAAGAAATTAGAAAACAGGACCTGTAGATTGTAAGAGATAGATTCT 

TGAGTTAGAAACACAAATGATTGTCC -3 1 , SEQ ID NO 6) of the alpha 1,3- 
f ucosyltransf erase gene of Physcomitrella patens is amplified 
from genomic DNA of Physcomitrella patens by Pfu-proof-reading 
PCR (Promega, Germany) using the upstream primer Recll__SmaI ( 5 1 - 

25 GAG CCC GGG ACC CAA GCG TAA GAA G-3 ' , SEQ ID NO 7) and the 
downstream primer Recll_SacI I_SsTI I (5'-TCT GAG CTC CCG CGG ACA 
ATC ATT TGT GTT TC-3 1 , SEQ ID NO 8). After restriction of the 
resulting amplification product with Smal and Sstl it is cloned 
into the vector pRT99Recl (Smal and Sstl digested) . The resulting 

30 plasmid pRT99Recll contains the 5' first and the 3 1 second 
recombination sequence . 

Construction of pRT99TPVEGFRecl 

35 The expression cassette containing the CaMV 35S promoter, 
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TPVEGF12 1 and CaMV 35S terminator is excised as a PstI fragment 
from pRTlOlTPVEGF C3 . This fragment is blunted by the Klenow 
reaction and introduced into the Smal digested and 
dephosphorylated plasmid pRT99Recll resulting in the plasmid 
5 pRT99TPVEGFRecl . 



Cloning procedure for 5J second recombination sequence into pRT99 

10 

The 208 bp 3 1 sequence of the 5 th intron: (5*- 
GGGACCCAAGCGTAAGAAGTCTTATGAAAAAGTTACCTCACAGATTAAAACTAAACATAGGA 
AAATACCAATGCACTCCAATGTGTCAATGAGATTAACGCTTGACTAACATGAAAATATAA 
ATATTCACCGAATGAAAGAAATTAGAAAACAGGACCTGTAGATTGTAAGAGATAGATTCT 

15 T GAG T T AG AAAC AC AAAT GAT T GT C C -3 ' , SEQ ID NO 6) of the alpha 1,3- 
fucosyltransf erase gene of Physcomitrella patens is amplified 
from genomic DNA of Physcomitrella patens by Pfu-proof -reading 
PCR (Promega, Germany) using the upstream primer 

Rec2_SalI_SacII (5 ' -GAG GTC GAC CCG CGG ACC CAA GCG TAA GAA G-3 ' , 

20 SEQ ID NO 9) and the downstream primer Rec2_SmaI (5 f -TCT CCC GGG 
ACA ATC ATT TGT GTT TC-3 1 , SEQ ID NO 10). After restriction of 
the resulting amplification product with Sail and Smal it is 
cloned into the vector pRT99 (Topfer et al . 1988) (Sail and Smal 
digested). The resulting plasmid pRT99Rec2 contains the 5 1 second 

25 recombination sequence. 

Cloning procedure for 3 ' first recombination sequence into 
pRT99Rec2 

30 The 2 50 bp 5 1 sequence of the 5 th intron: (5 1 - 
GC GG AAAT GT T C AGAGT T AAGC G AAAT C AC AACT AAAAG AG AT T GG AAG C AG AAGAATT 
TTTGAGCAGCTGTTCTTAATTCACGCAACGACAACGCTATTAACTGTATGTGTAGACGAT 
GCACTTTCGTACTGAAGGGATCTAAATTTATTATATCCCTTCATAACTAGAGGCAAGGCG 
GAAATCACAAAACTATTGGTACCTACGTACTACAGCCTCCAGGATCAAACATAAGAGTGA 

35 AACACTGGACC -3 1 , SEQ ID NO 3) of the alpha 1 , 3-f ucosyltransf erase 
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gene of Physcomitrella patens is amplified from genomic DNA of 
Physcomltrella patens by Pfu-proof -reading PCR (Promega, Germany) 
using the upstream primer Rec22_SmaI ( 5 1 -GAG CCC GGG AAA TGT TCA 
GAG TTA AGC G-3', SEQ ID NO 11) and the downstream primer 
5 Rec22_SacII_SstI ( 5 1 -TCT GAG CTC CCG CGG TCC AGT GTT TCA CTC TTA 
TG-3 1 , SEQ ID NO 12). After restriction of the resulting 
amplification product with Smal and SstI it is cloned into the 
vector pRT99Rec2 (Smal and SstI digested) . The resulting plasmid 
pRT99Rec22 contains the 5 1 second and the 3' first recombination 
10 sequences . 

Construction of pRT99TPVEGFRec2 

15 The expression cassette containing CaMV 35S promoter, TPVEGF 12 i 
and CaMV 35S terminator is excised as a PstI fragment from 
pRTlOlTPVEGF C3 . This fragment is blunted by the Klenow reaction 
and introduced into the Smal digested and dephosphorylated 
plasmid pRT99Rec22 resulting in the plasmid pRT99TPVEGFRec2 . 

20 Restriction of pRT9 9TPVEGFRecl and Rec2 with SacII or Sail and 
SstI results in linearisation of the first and the second 
heterologous nucleic acid sequences comprising the recombination 
sequences and the heterologous nucleic acid sequences of interest 
comprising a promoter operably linked thereto. The linearised 

25 heterologous nucleic acid sequences are used for transformation 
of moss cells. 

Standard culture conditions 



30 Plants are grown axenically under sterile conditions in plain 

inorganic liquid modified Knop medium (1000 mg/1 Ca(N0 3 ) 2 x 4H 2 0 
250 mg/1 KC1, 250 mg/1 KH 2 P04, 250 mg/1 MgS0 4 x 7 H 2 0 and 12.5 



mg/1 FeS0 4 X 7 H 2 0; pH 5.8 (Reski and Abel 1985)). Plants are 
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grown in 500 ml Erlenmeyer flasks containing 200 ml of culture 
medium and the flasks are shaken on a Certomat R shaker (B.Braun 
Biotech International, Germany) set at 120 rpm. Conditions in the 
growth chamber are 25 + /- 3°C and a light :dark regime of 16:8 h. 
5 The flasks are illuminated from above by two fluorescent tubes 
(Osram L 58 W / 25) providing 35 umols _1 m" 2 . The cultures are sub- 
cultured once a week via disintegration using an Ultra-Turrax 
homogenizer (IKA, Staufen, Germany) and inoculation of two new 
500 ml Erlenmeyer flasks containing 100 ml fresh Knop medium. 

10 

Protoplast isolation 

15 Pre-culture of moss tissue for optimal protoplast isolation. 

Mosses (especially Physcomitrella patens) can be pre-cultured 
under different conditions to obtain optimal protoplast yields: 

I. Rother et al . 1994 cultivated moss tissue for 7 days in Knop 
20 medium with reduced (10%) Ca(N0 3 ) 2 content. Cultures are filtered 

3 or 4 days after disintegration and are transferred into fresh 
Knop medium with reduced (10%) Ca(N0 3 ) 2 content. 

II. Instead of reduction of Ca(N0 3 ) 2 the medium for pre-culture 
can be supplemented with 5 mM ammonium tartrate or the pH can be 

25 altered to 4.5 (in liquid cultures with uncontrolled pH-values an 
average pH of 5.8 is reached for modified Knop medium). Cultures 
are filtered 3 or 4 days after disaggregation of tissue and are 
transferred into fresh modified Knop medium (supplemented with 5 
mM ammonium tartrate or altered to pH 4.5). 

30 

III. Hohe and Reski (2002) optimised culture conditions in a 
semi-continuous bioreactor to obtain high yields of protoplasts. 
Isolated protoplasts of high yields are obtained either by 
supplementation of modified Knop medium (Reski and Abel 1985) 

35 with 460 mg/1 ammonium tartrate or under controlled pH-values 
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with a setpoint of 4.5 (in bioreactor cultures with uncontrolled 
pH-values an average pH of 5.8 is reached for modified Knop 
medium) . 

5 

Different protocols for the isolation of protoplasts (Grimsley et 
al. 1977; Schaefer et al . 1991; Rother et al . 1994; Zeidler et 
al. 1999; Hohe and Reski 2002, Protocol Schaefer 2001) and for 
10 transformation (Schaefer et al . 1991; Reutter and Reski 1996, 
Protocol Schaefer 2001) have been described for Physcomltrella 
pa tens . 

For the work presented herein, a modification/combination of the 
previously described methods is used: 

15 

After filtration the moss protonemata are preincubated in 0.5 M 
mannitol. After 30 min, 4 % Driselase (Sigma, Deisenhofen, 
Germany) is added to the suspension. Driselase is dissolved in 
0.5 M mannitol (pH 5.6-5.8), centrifuged at 3600 rpm for 10 min 

20 and sterilised by passage through a 0.22 um filter (Millex GP, 
Millipore Corporation, USA) . The suspension, containing 1% 
Driselase (final concentration) , is incubated in the dark at RT 
and agitated gently (best yields of protoplasts are achieved 
after 2 hours of incubation) (Protocol Schaefer 2001). The 

25 suspension is passed through sieves (Wilson, CLF, Germany) with 
pore sizes of 100 um and 50 um. The suspension is centrifuged in 
sterile centrifuge tubes and protoplasts are sedimented at RT for 
10 min at 55 g (acceleration of 3; slow down at 3; Multifuge 3 S- 
R, Kendro, Germany) (Protocol Schaefer 2001) . Protoplasts are 

30 gently re-suspended in W5 medium (125 mM CaCl 2 x 2H 2 0; 137 mM 
NaCl; 5.5 mM glucose; 10 mM KC1; pH 5.6; 660-680 mOsm; sterile 
filtered) . The suspension is centrifuged again at RT for 10 min 
at 55 g (acceleration of 3; slow down at 3; Multifuge 3 S-R, 
Kendro, Germany) . Protoplasts are gently re-suspended in W5 

35 medium (Rother et al . 1994). For counting protoplasts a small 



3 
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volume of the suspension is transferred to a Fuchs-Rosenthal- 
chamber . 

Transformation protocol 

5 

For transformation protoplasts are incubated on ice in the dark 
for 30 minutes. Subsequently, protoplasts are sedimented by 
centrif ugation at RT for 10 min at 55 g (acceleration of 3; slow 
down at 3; Multifuge 3 S-R, Kendro) . Protoplasts are re-suspended 

10 in 3M medium (15 mM CaCl 2 x 2H 2 0; 0.1 % MES; 0.48 M mannitol; pH 
5.6; 540 mOsm; sterile filtered, Schaefer et al . 1991) at a 
concentration of 1.2 x 10 6 protoplasts / ml (Reutter and Reski 
1996) . 250 ul of this protoplast suspension are dispensed into a 
new sterile centrifuge tube, 50 ul DNA solution of both 

15 constructs, pRT99TPVEGFRecl and pRT99VEGFRec2 , and the vector 
containing the selection marker (column purified DNA in H 2 0 
(Qiagen, Hilden, Germany) ; 10-100 ul; DNA amount of 30 ug per 
construct; 10 ug of the vector containing the selection marker) 
is added and finally 250 ul PEG-solution (40% PEG 4000; 0.4 M 

20 mannitol; 0.1 M Ca(N0 3 ) 2 ; pH 6 after autoclaving) is added. The 
suspension is immediately but gently mixed and then incubated for 
6 min at RT with occasional gentle mixing. The suspension is 
diluted progressively by adding 1, 2, 3 and 4 ml of 3M medium. 
The suspension is centrif uged at 20 °C for 10 minutes at 55 g 

25 (acceleration of 3; slow down at 3; Multifuge 3 S-R, Kendro) . The 
pellet is re-suspended in 3 ml regeneration medium. Selection 
procedure is performed as described by Strepp et al. (1998) . 

30 DNA analysis 

DNA analysis of stably transformed plants is performed as 
described by Strepp et al . (1998) . Estimation of copy number is 
performed by Southern blot analysis and comparison to a stably 
35 transformed plant containing one copy of the heterologous DNA. 
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Quantification of recombinant VEGF 12 i 

Recombinant VEGF 12 i expressed by stably transformed moss plants is 
quantified by ELISA (R&D Systems, Wiesbaden, Germany) . The ELISA 
is performed according to the instructions of the manufacturer. 
The samples can be diluted for quantification. 



For stably transformed plants the estimation of high copy numbers 
15 of integrated constructs correlates with high yields of 
recombinant protein. 
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